Email Categorization with Tournament Methods
نویسندگان
چکیده
We propose to deploy the tournament methods to perform the task of email categorization. The multi-class categorization process is thus broken down into a set of binary classification tasks. We implement the elimination tournament and Round Robin tournament and apply them to classify emails within 15 folders. We conduct substantial experiments to compare the effectiveness and robustness of the tournament methods against the n-way classification method. The experimental results prove that the tournament methods outperform the n-way method by 11.7% regarding precision and the Round Robin performs slightly better than the Elimination tournament on average.
منابع مشابه
Improving Quality of Email Categorization with Tournament Methods
The tournament text classification methods are proposed in this article to perform the task of email categorization, in which the essence is to break down the multi-class categorization process into a set of binary classification tasks. We implement the methods of elimination and Round Robin tournament to classify emails within 15 folders. Substantial experiments are conducted to compare the ef...
متن کاملChoosing from a large tournament
A tournament can be viewed as a majority preference relation without ties on a set of alternatives. In this way, voting rules based on majority comparisons are equivalent to methods of choosing from a tournament. We consider the size of several of these tournament solutions in tournaments with a large but finite number of alternatives. Our main result is that with probability approaching one, t...
متن کاملPerformance Responses to Competition Across Skill-Levels in Rank Order Tournaments: Field Evidence and Implications for Tournament Design
Tournaments are widely used in the economy to organize production and innovation. We study individual contestant-level data on 2796 contestants in 774 software algorithm design contests with random assignment. Precisely conforming to theory predictions, the performance response to added contestants varies non-monotonically across contestants of different abilities; most respond negatively to co...
متن کاملAn Approach to Email Categorization with the ME Model
This paper puts forward a hierarchical approach for categorizing emails with the ME model based on its contents and properties. This approach categorizes emails in a two-phase way. First, it divides emails into two sets: legitimate set and Spam set; then it categorizes emails in two different sets with different feature selection methods. In addition, the pre-processing, the construction of fea...
متن کاملAutomatic Categorization of Email into Folders: Benchmark Experiments on Enron and SRI Corpora
Office workers everywhere are drowning in email—not only spam, but also large quantities of legitimate email to be read and organized for browsing. Although there have been extensive investigations of automatic document categorization, email gives rise to a number of unique challenges, and there has been relatively little study of classifying email into folders. This paper presents an extensive...
متن کامل